Model Selection

Multimodal Chatbot

# Multimodal Chatbot

Llava V1.5 7b M3

M3 is a multimodal model that allows explicit control of visual granularity at runtime and can serve as a metric for image/dataset complexity. It is fine-tuned from LLaMA/Vicuna.

ShareGPT4V-7B is an open-source multimodal chatbot model trained using GPT4-Vision-assisted data and LLaVA instruction fine-tuning data.

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase